Overview of the CLEF-2006 Cross-Language Speech Retrieval Track

نویسندگان

  • Douglas W. Oard
  • Jianqiang Wang
  • Gareth J. F. Jones
  • Ryen W. White
  • Pavel Pecina
  • Dagobert Soergel
  • Xiaoli Huang
  • Izhak Shafran
چکیده

The CLEF-2006 Cross-Language Speech Retrieval (CLSR) track included two tasks: to identify topically coherent segments of English interviews in a known-boundary condition, and to identify time stamps marking the beginning of topically relevant passages in Czech interviews in an unknown-boundary condition. Five teams participated in the English evaluation, performing both monolingual and cross-language searches of speech recognition transcripts, automatically generated metadata, and manually generated metadata. Results indicate that the 2006 English evaluation topics are more challenging than those used in 2005, but that crosslanguage searching continued to pose no unusual challenges when compared with monolingual searches of the same collection. Three teams participated in the monolingual Czech evaluation using a new evaluation measure based on differences between system-suggested and ground truth replay start times, with results that were broadly comparable to those observed for English.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Overview of the CLEF-2005 Cross-Language Speech Retrieval Track

The task for the CLEF-2005 cross-language speech retrieval track was to identify topically coherent segments of English interviews in a known-boundary condition. Seven teams participated, performing both monolingual and cross-language searches of ASR transcripts, automatically generated metadata, and manually generated metadata. Results indicate that monolingual search technology is sufficientl...

متن کامل

Overview of the CLEF-2007 Cross Language Speech Retrieval Track

The CLEF-2007 Cross-Language Speech Retrieval (CL-SR) track included two tasks: to identify topically coherent segments of English interviews in a known-boundary condition, and to identify time stamps marking the beginning of topically relevant passages in Czech interviews in an unknown-boundary condition. Six teams participated in the English evaluation, performing both monolingual and cross-l...

متن کامل

Speech Retrieval Experiments using XML Information Retrieval

This report presents the University of Twente’s first cross-language speech retrieval experiments in Cross-Language Evaluation Forum (CLEF). It describes the issues our contribution was focusing on, it describes the PF/Tijah XML Information Retrieval system that was used and it discusses the results for both the monolingual English and the Dutch-English crosslanguage spoken document retrieval (...

متن کامل

Domain-Specific Track CLEF 2006: Overview of Results and Approaches, Remarks on the Assessment Analysis

The CLEF domain-specific track uses databases in different languages from the social science domain as the basis for retrieval of documents relevant to a user’s query. Predefined topics simulate the user’s information needs and are used by the research groups participating in this track to generate actual queries. The documents are structured into individual elements so that they can be used fo...

متن کامل

The CLEF 2003 Cross-Language Spoken Document Retrieval Track

The current expansion in collections of natural language based digital documents in various media and languages is creating challenging opportunities for automatically accessing the information contained in these documents. This paper describes the CLEF 2003 track investigation of Cross-Language Spoken Document Retrieval (CLSDR) combining information retrieval, cross-language translation and sp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006